Picture for Jiangchao Yao

Jiangchao Yao

Focal Reward: Balanced Reinforcement Learning under Rubric-Based Rewards

Add code
May 26, 2026
Viaarxiv icon

Rethinking How to Remember: Beyond Atomic Facts in Lifelong LLM Agent Memory

Add code
May 19, 2026
Viaarxiv icon

CR^2: Cost-Aware Risk-Controlled Routing for Wireless Device-Edge LLM Inference

Add code
May 12, 2026
Viaarxiv icon

Deep Reprogramming Distillation for Medical Foundation Models

Add code
May 06, 2026
Viaarxiv icon

WISV: Wireless-Informed Semantic Verification for Distributed Speculative Decoding in Device-Edge LLM Inference

Add code
Apr 20, 2026
Viaarxiv icon

POINTS-Seeker: Towards Training a Multimodal Agentic Search Model from Scratch

Add code
Apr 15, 2026
Viaarxiv icon

A Sanity Check on Composed Image Retrieval

Add code
Apr 14, 2026
Viaarxiv icon

Eliciting Medical Reasoning with Knowledge-enhanced Data Synthesis: A Semi-Supervised Reinforcement Learning Approach

Add code
Apr 13, 2026
Viaarxiv icon

Predicting Neuromodulation Outcome for Parkinson's Disease with Generative Virtual Brain Model

Add code
Mar 31, 2026
Viaarxiv icon

GenMask: Adapting DiT for Segmentation via Direct Mask

Add code
Mar 25, 2026
Viaarxiv icon